Estimation of qubit pure states with collective and individual measurements 
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We analyze the estimation of a qubit pure state by means of local measurements on A'^ identical 
copies and compare its average fidelity for an isotropic prior probability distribution to the absolute 
upper bound given by collective measurements. We discuss two situations: the first one, where 
the state is restricted to lie on the equator of the Bloch sphere, is formally equivalent to phase 
estimation; the second one, where there is no constrain on the state, can also be regarded as the 
estimation of a direction in space using a quantum arrow made out of A*' parallel spins. We discuss 
various schemes with and without classical communication and compare their efficiency. We show 
that the fidelity of the most general collective measurement can always be achieved asymptotically 
with local measurements and no classical communication. 

PACS numbers: 03.67.-a, 03.65.Wj, 89.70.+C 



I. INTRODUCTION 

In Quantum Information, measurement and estimation 
are not just some important topics. They are at the very 
core of the theory. An unknown quantum state can only 
be unveiled by means of measurements, and the infor- 
mation is always gained at the expense of destroying the 
state. This information is then processed to obtain the 
desired estimate. 

Any estimation procedure requires a sample of iden- 
tical copies of the unknown state on which we can per- 
form measurements. If unknown states could be copied, 
then one could produce samples of an arbitrary number 
of copies and the state could be estimated with infinite 
accuracy. The no-cloning theorem, however, prevents 
this possibility 0. But even in such unphysical circum- 
stances, one would only have finite time and a limited 
number of resources for copying and measuring. So, in 
the real world only a finite, usually not large, number of 
copies are available and only a reasonable approximation 
to the unknown state can be made. It is thus very im- 
portant to devise strategies with optimal performance in 
a variety of practical situations. 

Over the last few years, it has been recognized that 
a joint measurement on N copies is more efficient than 
N individual measurements on each copy separately. We 
will often refer to the former as collective, in contrast to 
individual (also called local) measurements. The quan- 
tum correlations behind the collective measurements are 
(almost) always more powerful than the classical correla- 
tions used in sequential individual measurements 0, Q ■ 
This and other issues have been studied in various con- 

texts i a i 0, a ilia, mill El El El El Ea, but 

for local measurements not rnany analytical results have 
been obtained 13, lo', "231, l2l", '2Z "2?, •2li . The most pow- 
erful, involve sophisticated estimation theory technology 
that is not so widely known among physicists. More- 
over, they mainly apply in the asymptotic limit (when 
N is large), and in a "pointwise" fashion, in the sense 
that no average over the prior probability distribution of 
unknown states is considered |2a, |2« . 



Here we address the issue of estimating the most ele- 
mentary quantum pure state, that of a qubit, assuming 
we have a sample of N identical copies of it. We consider 
two relevant cases: estimation of a completely unknown 
qubit state, i.e. one given by an arbitrary point on the 
Bloch sphere; and estimation of restricted states that are 
known to lie on the equator of the Bloch sphere. The 
latter is also interesting because it is formally equivalent 
to phase estimation (these states are also equivalent to 
the so-called rebits ,27J). We refer to these two situations 
as 3D and 2D cases, respectively. 

The most general measurement is described by a Posi- 
tive Operator Valued Measure (POVM) on the TV copies. 
An optimal measurement of this type yields the ultimate 
bounds that can be achieved by any estimation proce- 
dure. We will re-derive these bounds in a unified, very 
comprehensive framework. For local measurements, we 
will consider von Neumann measurements, as these can 
be readily implemented in a laboratory. So, by "local 
measurement" we will loosely mean a local von Neumann 
measurement. Furthermore, we will show that for some 
of the local procedures discussed here, optimal measure- 
ments are necessarily of von Neumann type. We will 
refer to local schemes that use classical communication, 
i.e. those that exploit the possibility of using the actual 
outcomes that are being obtained to dynamically adapt 
the next measurements, as LOCC (Local Operations and 
Classical Communication) schemes. 

Let us be more concrete about the problem we are 
concerned with. Assume that we are given an ensemble 
of N identical copies of an unknown qubit state, which 
we denote by |n), where n is the unit vector on the Bloch 
sphere that satisfies 



|n)(n| 



1 + n • (T 



(1) 



and a = (ca; , cry , cr^ ) are the usual Pauli matrices. After 
performing a measurement (collective or local) on the N 
copies of |n), one obtains an outcome x- Based on Xi 
an estimate, |M(x)), for the unknown state is guessed. 
To quantify how well |M(x)) approximates the unknown 
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state \n) we use the fidelity, defined as the overlap 



/„(x)^|(n|M(x))P = 



1 + n- M (x) 



(2) 



The fidelity (0) can be regarded as a "score": we get 
"1" for a perfect determination {M = n) and "0" for 
a completely wrong guess (M — — n). Our aim is to 
maximize the average fidelity, hereafter fidelity in short, 
over the initial probability and all possible outcomes. 



F={f)=Y,J dnUx) Pnix), 



(3) 



where dn is the prior probability distribution, and Pn{x) 
is the probability of getting outcome x given that the 
unknown state is \n). 

As mentioned above, we allow for general measure- 
ments or POVMs. They are defined by a set of posi- 
tive operators {0{x)} (each one of them associated to 
an outcome) that satisfy the condition 



(4) 



The probability p„(x) is given in terms of these operators 

by 



Pnix) = tr[p„0(x)]. 



(5) 



where p„ is the quantum state of the N identical copies, 
i.e., p„ = {\n){n\r^. 

In Eq. there are only two elements that require 
optimization: the guess and the POVM, which enter this 
equation through |2Jl and jSJ, respectively. The optimal 
guess (OG) can be obtained rather trivially. The Schwarz 
inequality shows that the choice 



Mix) 



vix) 
\vix)l 



where 



Vix) = / dnnpnix), 



(6) 



(7) 



maximizes the value of the fidelity, which then reads \2z 



F = -il 
2 ^ 



A) = - 
' 2 



(8) 



(in the strict sense we should write -Fog- but to sim- 
plify the notation, we drop the subscript when no confu- 
sion arises). Eq. ^ gives the best state that can be in- 
ferred, and Eq. JSj) gives the maximum fidelity that can 
be achieved for any prior probability and any measure- 
ment scheme specified by the conditional probabilities 
Pnix)- We are thus left only with the non-trivial task 
of obtaining the optimal measurement. The goal of this 



paper is to compute the maximum value of © within 
a unified framework for various measurement schemes, 
specially the local ones. 

The paper is organised as follows. In the next section 
we derive the bounds on the fidelity of the optimal col- 
lective measurements. In Sec. lIIII we discuss several local 
measurement schemes, with and without classical com- 
munication. Asymptotic values of the fidelity are com- 
puted in Sec. IIVI A summary of results and our main 
conclusions are presented in Sec. ]V\ and two technical 
appendices end the paper. 



II. COLLECTIVE MEASUREMENTS 

A. 2D states 

A 2D state corresponds to a point that is known to lay 
on the equator of the Bloch sphere. If we take it to be 
on the xy plane, such state, \n), has n = (cos 0, sin 0, 0). 
If no other information is available, the prior probability 
distribution has to be isotropic, that is, dn = d0/(27r). 
Eq. ijHl then reads 



^ = E / |^^tr[p„0(x)] 
Notice that we can write, 

Pn = pi9) = uie)poU\e), 



(9) 



(10) 



where po is a fiducial state of angular momentum J = 
N/2 and maximal magnetic quantum number, m = J, 
along any fixed direction on the equator of the Bloch 
sphere. In particular, po can be chosen to point along 
the X axis, 



Po = \JJ)xx{JJ\- 



(11) 



In Eq. Hl()|l . 17(6') is the unitary representation of a ro- 
tation around the z axis. The group of such unitary 
matrices is isomorphic to ?7(1). In the standard basis 
\jm) = |m), where Ui9) is diagonal, we have 



p(0) = ^e'("-")'' (po), 

m n 

We are now ready to compute ||2J): 



|to)(?i| 



(12) 
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iPo)mn [Oix)]r 



iPo) 



-1 [Oix)] 



m+1 m 



(13) 



where [0(x)]mn = ('7t|0(x)|'^)- The following inequali- 
ties give the maximum value of A: 



A < E E \(po) 

X m= — J 



.+1 [Oix)] 



m+1 m 
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j-i 



< l(po)„™+iiXli['^(^)]"+i' 

m= — J X 

< E \ipo) mm+l I ' 



m— — J 



where in the last step in (|14|1 we have used that 



(14) 



(15) 



as follows from positivity and Q. More precisely, posi- 
tivity implies 

[0{x)Um [0{X)U+1 rn+1 > \ [0{x)]mm+l \\ (16) 

and the Schwarz inequality yields 



< 



lY.l(^ix)Un.JJ2[0ix)]m+lm+l = 1- (17) 



There are two points worth emphasizing here. First, 
all the inequalities can be saturated, therefore the bound 
is tight (we give below a POVM that accomplishes this 
task). Second, the bound (|14|l is completely general. If 
we were interested in encoding the information carried by 
a phase in a covariant way [as in (|10|l ] , but using a gen- 
eral fiducial state [lllllla, Po = Emm' ama*^,\m){m'\ 
(ideally the best possible one, which is not necessarily a 
tensor product of identical copies) , the fidelity would still 
be bounded by ffT^i . In this general case, one has A < 

Emm' O'm'^m'mam, whcrC 2Mm'm = <^m' m+1 + <^m' + lm, 

and its maximum value is given by the largest eigenvalue 
of the matrix M . A straightforward calculation gives 
Fniax = (l + cos[27r/(d.7 + 2)])/2, where dj = 2j + l is the 
dimension of the invariant Hilbert space corresponding 
to the representation j of SU{2). 

In our problem po is constrained by the condition of 
having identical copies. From Hll|l and (|14|l one has @ 



A < 




N \ / N 
J + m ! \ J + m + 1 



2N ^ \J + m J \J J + m + V 

m— — J 



J — m 



(18) 



where we have used that 



2'J ^ \ J 

m— — J 



N 



\Jm) (19) 



(recah that J = A^/2). 

We next show that there are POVMs that attain this 
bound. To saturate the first inequality in l|14|) the phase 



of [0(x)]mm+i must be independent of to. This is en- 
sured if this phase is a function of m — n. Similarly, a 
set of positive operators for which |0(x)mn| = constant 
for all X, TO and n, will certainly saturate the remaining 
inequalities in l|14() . In particular, the covariant (contin- 
uous) POVM, whose elements are given by 



— J^{ra-n) 
mn — c 



(20) 



satisfies all the requirements. Note that we have labeled 
the outcomes by a rotation angle 0, which plays the role 
of X- Hence, condition Q becomes 



2tt 



0{^) = 1, 



(21) 



which certainly holds for H2(J|I . These are rank one oper- 
ators, and can also be written as 



where 



0{c^) = UmB){B\U^{4'), 



\B)= |J,m). 

7ri— — J 



(22) 



(23) 



We have just shown that at least one optimal mea- 
surement [i.e., a POVM that saturates 1)14(1 ] exists, but 
other optimal measurements can be found. POVMs with 
finite number of outcomes, for instance, are straightfor- 
ward to obtain by choosing (j) to be the dj-th roots of 
unity, namely. 



[om 



van — , exp 

do 



, 27rfc 
iym — n)— — 
dj 



(24) 



where k = 1, . . . ,dj. In this case {0(fc)} is a von Neu- 
mann measurement, since the number of rank one POVM 
elements equals the dimension of the Hilbert space. 

In the asymptotic limit the fidelity can be obtained in 
terms of the moments of a binomial distribution Bin(n, p) 
with parameters n = N and p = 1/2. We simply need 
to expand (|18|) in powers of to, i.e., around (m) = 0, to 
obtain 



A < 



2^ E 

n 

2m 



N 
J + m 



(25) 



1 - 



N 



2m^ 



- 1 ] + 0(l/7V3/2) 



(notice that sum over to is shifted by J with respect 
to the usual binomial distribution). The moments are 
well known to be (1) = 1, (to) = and (to^) = N/A. 
The latter shows that m has "dimensions" of Vn, which 
helps to organise the expansion in powers of 1/A^. We 
finally obtain 

A,,,. = 1 - -L + . . . , (26) 



2N 



and a fidelity 



F = 1- 



AN 



(27) 
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B. 3D states 

A 3D state \n) corresponds to a general point on the 
Bloch sphere. As in the previous section we write 



Pn = p{n) = U{n)poU^n), 



(28) 



where for convenience po is now chosen to point along 
the z axis, i.e., 



Po = \JJ){JJl 



(29) 



and U{n) is the unitary representation [i.e., the element 
of SU{2)] of the rotation that brings z into n (a rotation 
around the vector zxn). 

Recalling © and Q, we have 

Vix) = J dnntr[p„0(x)], (30) 

where dn is the invariant measure on the two-sphere, e.g., 

d{cos 6)d(j) 



dn 



An 



(31) 



where and (f> are the standard azimuthal and polar an- 
gles. Notice that we can always define an operator ri(x) 
in such a way that 



0{x) ^ U[Mixmix)U^Mix)], 



(32) 



where M{x) is given by ©. Taking into account that A 
is rotationally invariant one obtains 



E 



dn n^ tr [pn^ix) 



We readily see that n^ = cos 6* = !DQQ^(n), where the 
rotation matrices are defined in the standard way, 

®mL'("') — {im\U{n)\im'). We then have 

X mm' 



(33) 



mix 

where in the second equality we have used that 



(34) 



(35) 

as follows from Schur's lemma after realizing that the left 
hand side of H35|l commutes with U (6), the unitary trans- 
formations defined right after Hll|l . Recall that these 
transformations, which are a U{1) subgroup of SU{2), 
have only one-dimensional irreducible representations, la- 
beled by the magnetic quantum number to, thus yielding 
relation H35|) . In (|34|l we have removed the absolute value 



as all terms are positive (see below). Tracing (|32() one ob- 
tains ^^trn{x) = ^^^^ 0{x) = dj. Therefore 



A < dj max / dn'Sr}Q{n) {pn) 

m J 

= max(10; Jto| Jto)^ — 



J+1' 



(36) 



where in the second equality we have used that pmm = 

®m /('"') ®m J* (^) ^'^'^ ^^"^ ^^^1 known orthogonality rela- 
tions of the SU(2) irreducible representations Re- 
calling that J = N/2 we finally get jsj 



F = 



N +1 



iV + 2' 

which for large N behaves as 

N 



(37) 



(38) 



Let us finally give a POVM that saturates all the 
inequalities. The maximum of the Clebsch-Gordan 
(10; Jm|Jm) in H36|l occurs at m = J. Hence, to attain 
the bound we need to choose 



^mm(x) — ^ ^ dj 



(39) 



where the coefficients Cy^ are positive. This leads straight- 
forwardly to the optimal continuous POVM defined by 



0{m) = dj U{7fi)\JJ){JJ\U\ffi) 



(40) 



(one can check that j dm 0{fh) — 1). POVM's with 
finite number of elements can also be constructed. The 
only requirements are and, of course, Q). These 
constraints translate into a series of conditions for the 
set of directions {to}, for which solutions with a finite 
number of elements can be found. We refer the reader to 
the hterature HH mill for details. 



III. LOCAL MEASUREMENTS 

Collective measurements, although very interesting 
from the theoretical point of view, are difficult to imple- 
ment in practice. Far more interesting for experimental- 
ists are individual von Neumann measurements |2lL l3lj | . 

Individual von Neumann measurements on qubits are 
represented by two projectors 



0(±to) 



1 ± TO • tJ 



(41) 



where to is a unit Bloch vector characterizing the mea- 
surement (in a spin system, e.g., to is the orientation of 
a Stern-Gerlach). In a general frame we must also allow 
classical communication, i.e., the possibility of adapting 
the orientation of the measuring devices depending on 
previous outcomes 
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In the next sections we study quantitatively several 
schemes in ascending order of optimality: from the most 
basic tomography inspired schemes [2flll3 the most 
general individual measurement procedure with classical 
communication p^. 

Our aim is to investigate how good these local mea- 
surements are as compared to the collective ones. We 
would like to stress that in this context few analytical 
results are known 0, . Our results here comple- 

ment and extend the analysis carried out by some of the 
authors in p^ . 

A. Fixed Measurements 

Let us start with the most basic scheme for recon- 
structing a qubit: fixed von Neumann measurements 
along 2 orthogonal directions (say, x and y) in the equator 
of the Bloch sphere for 2D states, or along 3 orthogonal 
directions (say, x, y and z) for 3D states. This kind of 
scheme is often called tomography [sollsills^ . 

Consider N = 2M (3?^) copies of the state \n). After 
measurements along each one of the directions Ci, i = 
x,y,{z), we obtain a set of outcomes and —1 with 
frequencies Ka^ and ?vf(l — a^), respectively. This occurs 
with probability 

(42) 

where n.i are the projections of the vector n along each 
direction, rii = n • e^, and we have used the shorthand 
notation a = {a^}. The combinatorial factor takes into 
account all the possible orderings of the outcomes and 
the remaining factors are the quantum probabilities, i.e, 
the appropriate powers of tr [\n){n\0{±ei)]. 

Since the expectation value of a is {n\a\n) — ft, a 
straightforward guess based on the relative frequencies 
of each outcome is 

M^^^ia) = , '"-"^ , (43) 

V^E,(2«,-i)2 

where the superscript stands for central limit guess 
(CLG). Notice the normalization factor which ensures 
that |M'-^'"'~^| = 1, hence M'~^^'^ always corresponds to a 
physical pure state ■ The average fidelity in this case 
is given by 

F^\ + \Y. h""'^'- ^^'^''(«)P«(a), (44) 

where Pn{a) is defined in (021 ■ Although the CLG 
is not the best state that one can infer from the data, it 
has the nice property that it can be directly (and easily) 
obtained from the observed frequencies without further 
processing. So, it is interesting to compare its fidelity 
with the optimal one, given by {Tj) and ijHJl (see Fig.nj. 




10 20 30 40 50 60 

N 



FIG. 1: Average fidelities in terms of the number of copies 
in the 2D case for the optimal collective measurement (solid 
line), tomographic OG (dot-dashed line), tomographic CLG 
(dashed line) and a simulation of the greedy scheme (dots). 

The OG for this measurement scheme is M'-^'^{a) = 
V{a)/\V{a)l where 

V{a) = J dnnpnia), (45) 

and, from ((SJ, the optimal fidelity reads F = [1 -f 
'^^V{a)]/2. Closed expressions of the fidehty for the 
lowest values of N can be derived from (|44|l and H45|) 
using 

j dnni^n^^---ni^ = ^Si^i^^J,^,^---Si^_^^^y, (46) 

where the normalization factor is Kq — ql\ in 2D and 
Kg = {q + 1)!! in 3D, and the indexes in curly brack- 
ets are fully symmetrized, e.g., 6{i^iJ^^^^y = '^nia'^iau + 
Si-iisSi^i^+Si-^i^Si^i^. Obviously, the integral H46|l vanishes 
for q odd. For larger values of N the expressions became 
rather involved and we have resorted to a numerical cal- 
culation. 

The 2D case is illustrated in Fig. ^ where the aver- 
age fidelity for the above two guesses and N in the range 
10— 60 is shown. In both instances the fidelity approaches 
unity as N increases, and the OG always performs bet- 
ter than the CLG, as it should. Notice that to make 
the graphs more easily readable we have interpolated be- 
tween integer points. 

At this point, it is convenient to define the scaled error 
function 

eN^N{l~F), (47) 

and the limit 

e = lim eN, (48) 

N^oc 

which gives the first order coefficient of the fidelity in the 
large N expansion, F = 1 — e/N + ■ ■ ■ (the asymptotic 
behavior will be properly discussed in Sec. lIVp . Fig. [21 
shows Eat as a function of N for 2D states. One read- 
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FIG. 2: Scaled error ejv = N{1 - F) in the 2D case for col- 
lective (solid line), OG (dot-dashed line), CLG (dashed line) 
and greedy (diamonds) schemes. 




30 40 

iV 



FIG. 3: Scaled error ejv — N{1 — F) in the 3D case for col- 
lective (solid line), OG (dot-dashed line), CLG (dashed line) 
and greedy (diamonds) schemes. 



ily sees that the CLG gives e^^^ « 3/8 while for 
collective measurements one has e'^OL ^ 1/4:, in agree- 
ment with Eq. |(23). The stability of the curves e^°^ 
and fS^'^ shows that the fidelity is well approximated by 
F = 1 — e/N for such small values of N as those in the 
figure. This asymptotic regime is not yet achieved by 
the OG, however we will show in Sec. II VI that the OG 
gives e*-"^ ^1/4, thus matching the collective bound for 
large N. 

Fig.|3|shows the scaled error in the 3D case. Again, one 
readily sees that the OG performs better than the CLG. 
However, the improvement does not seem to be enough 
to match the collective bound. We prove analytically in 
the next section that = 13/12 > e^°^ = 1. 

In previous paragraphs we have presented the most 
basic scheme, i.e., that with a minimal number of ori- 
entations of the measuring devices and without exploit- 
ing classical communication. A next step in complex- 
ity is to consider a more general set of fixed directions 
{mfe}. It is intuitively clear that, assuming some sort of 
isotropy, the more directions are taken into account, the 
better the estimation procedure will be. For instance, 
in 2D we may consider a set of directions given by the 



angles 6k = kir/N, where k = 1, . . . , N. The set of out- 
comes X can be expressed as an iV-digit binary number 
X — «Ar«Ar-i • • • 42*1, where ik (= 0,1) and the fidelity 
then reads 



F : 



2"-l 

E 

x=oo---o 



N 



dn n 



1 + {-lY'^ft- rhk 



k=l 



(49) 



Analytical results for low N can be obtained using H46|l . 
For large N, numerical computations show that this 
"isotropic strategy" is indeed better than tomography 
(see [33 for explicit results), but there is no substantial 
improvement. 

For 3D states, however, it is really important to con- 
sider more general fixed measurement schemes, such as 
a 3D version of the isotropic one we have just discussed. 
One can readily see from Fig. O that the tomographic 
OG does not saturate asymptotically the collective bound 
and one could be tempted to think that classical commu- 
nication may be required to attain it ,19,, 22] . That would 
somehow indicate a fundamental difference between the 
estimation of 2D and 3D states. 

There is a difficulty in implementing the isotropic 
scheme for 3D states since the notion of isotropic distri- 
bution of directions is not uniquely defined, which con- 
trasts with the 2D case. A particularly interesting scheme 
that encapsulates this notion (at least asymptotically) 
and enables us to perform analytical computations con- 
sists of measurements along a set of random directions. 
With the same notation as in 149(1 . the fidelity for this 
set of directions can be written as 



2™-! N 
X=Q---0'^ k=l •' 

(50) 

In Fig. 0] we show the scaled error, e^v, obtained from 
numerical simulations for rather large N . One readily 
sees the improvement of the random scheme over the to- 
mographic OG. We will show in the next section that 
the former indeed attains the collective bound asymp- 
totically, thus resolving the puzzle of whether classical 
communication is needed or not. A numerical fit gives a 



^rand 



1.002 ± 0.008, which provides a numerical 



value 

check of the analytical results of section ITVl below. 



B. Adaptive Measurements 

In this subsection we will discuss schemes that make 
use of classical communication. In principle, they should 
be more efficient than those considered so far. 



1. One step adaptive 

We first review a method put forward by Gill and Mas- 
sar 113] , which we call "one step adaptive" . This scheme. 
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FIG. 4: Scaled error eat of the random scheme (triangles) 
as compared to the OG (dot-dashed line) and the optimal 
collective scheme (solid line). 



although very simple, suffices to show in a very straight- 
forward way that local measurements attain the collec- 
tive bounds in 2D and 3D. It also has the nice feature 
that only one reorientation of the measuring device is 
required. 

The basic idea of the method is to split the measure- 
ments in two stages. In the first one a small number of 
copies is used to obtain a rough estimate Mq of the state. 
In the second stage the remaining copies are used to re- 
fine the estimate by measuring on a plane orthogonal to 
Mq. This strategy has a clear motivation from the in- 
formation theory point of view. A measurement can be 
regarded as a query that one makes to a system. The 
most informative queries are those for which the prior 
probabilities of each outcome are the same. Measure- 
ments on the orthogonal plane to Mq have this feature. 
The method turns out be efficient if the number of copies 
used in each of the two stages is carefully chosen. 

To be more concrete, suppose we are given copies 
of an unknown qubit state. Let Nq stand for the number 
of copies used in the first stage and let iV = iV — A^o 
stand for the rest. In the 2D (3D) case, one measures 
No/2 {No/ 3) copies along two (three) fixed orthogonal 
directions and infers the guess Mq. In the second stage, 
one measures A" {N/2) along u {u,v), which are chosen 
so that {u, V, Mo} is an orthonormal basis, and infers the 
final guess M. If Nq cx N°- with < a < 1, one can 
show that the one step adaptive scheme saturates the 
collective bound in the asymptotic regime T9]. Although 
we do not have a rigorous proof, our numerical analysis 
reveals that the optimal value of a, i.e., the one that 
gives the maximal fidelity, is a ~ 1/2. For other choices 
of a the scheme can even be less efficient than some fixed 
measurement schemes. For the benefit of the reader, we 
present a detailed discussion of the method and a proof 
of the asymptotic limit within our unified framework in 
Appendix ^ 



2. Greedy scheme 

We now move forward to more sophisticated schemes 
and discuss one that exploits much more efficiently clas- 
sical communication. The idea behind it is to maximize 
the average fidelity at each single measurement step. It is 
called "greedy" because it does not take into account the 
total number of available copies, instead, it treats each 
copy as if it were the last one [23, |21| . 

We ffi'st need to introduce some notation. Recall that 
the set of outcomes x can be expressed as a A^-digit bi- 
nary number x = iN^N-i ■ • • ^2^1 (ife — 0, 1). Since we 
allow the fc-th measurement to depend on the list of previ- 
ous outcomes, ik-iik-2 ■ ■ ■ *2«i = Xk (note that x — Xn), 
we have m(xfc) instead of fhk. This is a compact no- 
tation where the length k of the string Xk denotes the 
number of copies upon with we have already measured. 
The orthogonality of the von Neumann measurements is 
imposed by the constraint 



m{lxk-i 



-m(Oxfe_i 



(51) 



where lxfe_i is the list of length k obtained by prepend- 
ing 1 to the list Xk-i, and similarly for Oxk-i- In gen- 
eral, the number of independent vectors for a given A^ 
is (Ef=i 2'')/2 = 2^-1. For example, if A^ = 2 there 
are three independent directions, which can be chosen as 
m(0), to(OO), to(01), and the other three are obtained us- 
ing Eq. (|51|l . Since the ffist measurement can be chosen 
at will, this number is reduced to 2^ — 2. 

The general expression of the conditional probability 
thus reads 



N 



pnix) = n 



l-f n • Th{xk) 



(52) 



fc=i 



and, as discussed in the introduction, the OG gives a 
fidelity = (1 A)/2, where 



A = 



2"-l 

E 

x=oo---o 



dnnpn{x) 



(53) 



We could in principle attempt to maximize this expres- 
sion with respect to all the independent variables, i.e., 
all independent {w(xfc)}- However, the maximization 
process very quickly becomes extremely difficult. In the 
greedy scheme one takes a more modest approach: one 
maximizes at each step k. This enables us to find a com- 
pact algorithm for computing the fidelity, as we discuss 
below. Furthermore, we show in Appendix B that in this 
situation the optimal local measurement at each step is 
indeed of von Neumann type, i.e., any other POVM will 
perform worse. 

Let us concentrate on the last step, A^, of the greedy 
scheme. Suppose we have optimised the previous A^ — 1 
measurements and have obtained a string of outcomes 
Xn-i- To ease the notation, let us denote the direc- 
tion of the last measurement by mTv, namely rfiN = 
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m(0x7V-i) = ^"^(IXA^-i)- We then need to maximize 
diXN) = \V{OxN~i)\ + \V{lxN-i)\. (54) 

Here 

V{iNXN-ij = / dnnpnixN^ij ^ 

or, equivalently, 

V{^NXN-l) = ^["t^(XJV-i) + (-l)^"A(xjv-i)mAr], (56) 

where A is the real positive symmetric matrix with ele- 
ments 

^kiiXN-i) = dnnkniPn{xN~i)- (57) 



Therefore 

d{XN) 



^[\V{xN-i) + HXN-i)mN\ 
\VixN-i) - A(xw-i)tojv|| • 



(58) 



Notice that for 2D states and fixed d{xN) the points jl — 
Am^v lay on an ellipse with focus at ±V (an ellipsoid 
for 3D states). In addition they fulfil the normalization 
constraint 



aI-(a-V)-i, 



(59) 



which also defines an ellipse (ellipsoid in 3D) centered at 
the origin. As usual, optimality tells us that the maxi- 
mum of d(xjv) occurs at the points of tangency of the el- 
lipses (ellipsoids). This provides a geometrical procedure 
for finding the optimal direction m^r and an algorithm 
for computing |V^(x7v)|. 

We now proceed to obtain some explicit expressions 
for low N. We discuss only the 3D case, as the 2D case 
is completely analogous (numerical results for 2D states 
are shown in Fig. and Fig. [21). 

When we only have one copy of the state, = 1, the 
Bloch vector of the measurement can be chosen in any 
direction, say e^^ i.e., m(0) = — m(l) = e^- The explicit 
computation of the vector in 10 gives 



^(Xi) = ^™(xi). 



(60) 



and = 2/3, as expected from (EZI) [or (fH^ ]. 

The first non-trivial case is iV = 2. The matrix A(xi) 
reads 



Afei(xi) ^4; 



Xi =0,1, 



(61) 



i.e., A(xi) is independent of xi and proportional to the 
identity. The maximum of H58|l occurs for 1x12 -L Sx, so 
we choose rn2 = Cy, which means, to(OO) = to(01) = Cy, 
(notice that in general these two vectors do not need to be 



equal, they are only required to be orthogonal to m(0)]. 
Because of H51f) . we also have m(10) = m(ll) — —By. 
The OG reads 



m(2)(x) 



m{x2) + m{xi) 
V2 



e.g., M(2)(oi) 

m{0)]/^/2=[ey 



= [m(01) + m(l)]/\/2 = 
6*2;] /-v/S. One obtains 



\V{X2)\ 



V2 
12 



(62) 
[m(01) - 



(63) 



for all X2j which implies 



3 + V2 
6 ■ 



(64) 



The case N = 3 can be computed along the same lines. 
One can easily see that m(x3) has to be perpendicular 
to m{x2) and to(xi). This shows that, up to iV = 3, the 
greedy approach does not use classical communication, 
i.e, the directions of the measuring devices are only re- 
quired to be mutually orthogonal, independently of the 
outcomes. The optimal guess is a straightforward gener- 
alization of H62|) : 



M(3)(x) = 



mjxa) + ■m{x2) + m{xi) 
V3 



and the fidelity reads 



a/3 



6 



(65) 



(66) 



The above results could have been anticipated. As 
already mentioned, the outcomes of a measurement on 
the plane orthogonal to the guess have roughly the same 
probability and are, hence, most informative. One can 
regard these measurements as corresponding to mutually 
unbiased observables, i.e., those for which the overlap 
between states of different basis (related to each observ- 
able) is constant [s^. Hence, there is no redundancy in 
the information about the state acquired from the differ- 
ent observables. This point of view also allows to extend 
the notion of (Bloch vector) orthogonality to states in 
spaces of arbitrary dimension. 

The case A'^ = 4 is even more interesting, since four 
mutually orthogonal vectors cannot fit onto the Bloch 
sphere. We expect classical communication to start play- 
ing a role here. Indeed the Bloch vectors m(x4) do de- 
pend on the outcomes of previous measurements. They 
can be compactly written as 0| 



to(x4) = 



(-1)- 

V2 



2 

E 

k=l 



m{xk) X m{x3) 



(67) 



Again, one can see that the vectors m(x4) are orthogonal 
to the guess one would have made with the first three 
measurements. The fidelity in this case is 



15 



30 



(68) 
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For larger TV, we have computed the fidelity of the 
greedy scheme by numerical simulations. In Fig|31(Fig.n 
and Fig. 13 for 2D states) we show the results for 10 < 
iV < 60 (diamonds). Notice that the greedy scheme is 
indeed better than fixed measurement schemes and ap- 
proaches the collective bound very fast. 

Actually, the greedy scheme is the best we can use 
if we do not know a priori the number of copies that 
will be available; obviously, the best one can do in these 
circumstances is to optimise at each step. However, if N 
is known, we have extra information that some efficient 
schemes could exploit to increase the fidelity. We next 
show that this is indeed the case. 



3. General LOCC scheme 

In the most general LOCC scheme one is allowed to 
optimise over all the Bloch vectors {Tn{xk)}i thus taking 
into account the whole history of outcomes. Up to iV = 3 
the results are the same as for the greedy scheme: orthog- 
onal Bloch vectors for the measurements and no classical 
communication required. The results (|64|l and (|66|l are, 
therefore, the largest fidelity that can be attained by any 
LOCC scheme. 

The most interesting features appear at = 4. Here 
there are 14 independent vectors, which can be grouped 
into two independent families of seven vectors. With such 
a large number of vectors an analytical calculation is too 
involved and we have resorted partially to a numerical op- 
timization. The solution exhibits some interesting prop- 
erties. First, one obtains that rh{xi) -L ?7i(x2), for all 
Xi and X2, as in the N ~ 2 and N — 3 cases. Therefore 
one can choose to(xi) — {—ly^Cx and m(x2) = {—^Y^ey 
Only for the third and fourth measurement one really has 
to take different choices in accordance to the sequence of 
the preceding outcomes. The Bloch vectors of the third 
measurement can be parametrized by a single angle a as 

(-1)*-^to(x3) = cosaMi(x2) + sinQ;i;i(x2), (69) 
where 



where 



Ul(X2) 
^^l(X2) 

S{X2) 



m(xi) X m(x2), 
Ul{X2) X s(x2), 
"i(x2) + m(xi) 
V2 



(70) 



Notice that, rather unexpectedly, m(xi), w(x2) and 
to(x3) are not mutually orthogonal. The optimal value 
of this angle is a = 0.502. Although we cannot give any 
insight as to why this value is optimal, in agreement with 
our intuition one sees that m{x3) -L M(x2), i.e., the third 
measurement probes the plane orthogonal to the Bloch 
vector one would guess from the first two outcomes [see 
Eq. (EH)]. The vectors of the fourth measurement can be 
parametrized by two angles, (3 and 7, as 



W2(X3) = s(x2) X m(x3), 

W2(X3) = cos/3m(x3) - sin/?s(x2) 



(72) 



The optimal values of these angles are /3 = 0.584, 7 



0.538, and the corresponding fidelity is 



(4) 



ral 



0.8206. 



This is just 1.5% lower than the absolute bound 5/6 = 
0.8333 attained with collective measurement, Eq. H37|l . 
Note that this value is slightly larger than the fidelity 
obtained with the greedy scheme, F^^l^^^^ > F^^edy = 



(15 -I- \/9T)/30 « 0.8180. The extra information consist- 
ing of the number of available copies has indeed been used 
to attain a larger fidelity. We conclude that for A^ > 3, 
it pays to relax optimality at each step, and greedy 
schemes [lO, |^ are thus not optimal. We would like to 
remark that if, for some reason, some copies are lost or 
cannot be measured, the most general scheme will not be 
optimal, since it has been designed for a specific number 
of copies. We have also computed the values of the max- 



imal LOCC fidelities for A^ = 5,6: F^ 



(5) 



0.8450 and 



(6) 



(-l)*''m(x4) = cos7U2(x3) + sin7t72(X3), 



(71) 



general ~ 0.8637. Bcyoud A^ = 6 the small differences 
between this and the greedy scheme become negligible. 



IV. LOCAL SCHEMES IN THE ASYMPTOTIC 
LIMIT 

The asymptotic expression of the fidelity enables us 
to compare different schemes independently of the num- 
ber of copies. If two schemes have the same asymptotic 
fidelity, it is justified to say that they have the same 
efficiency, and conversely. Here we will compute such 
asymptotic expansions. We will show that, asymptot- 
ically, classical communication is not needed to attain 
the absolute upper bound given by the maximum fidelity 
of the most general collective measurements. Some of the 
results presented in this section were obtained by two of 
the authors by explicit computations in (22] . Here we will 
use a statistical approach that relate the Fisher informa- 
tion I •37^ with the average fidelity F. This approach will 
greatly simplify our earlier derivations. 

We label the independent state parameters by the sym- 
bol ij. This symbol will refer to the two angles 9, (p for 
3D states: ij = {9, 0); and the polar angle 9 for 2D states: 
7] = 9. 

Assume that under a sensible measurement and esti- 
mation scheme (we mean by that a scheme that leads to 
a perfect determination of the state when A^ — > 00, i.e, 

F ^-^^ 1) the estimated state is close to the signal state, 
that is, their respective parameters ?7(x) and rj differ by 
a small amount. In this Section, a hat (') will always re- 
fer to estimated parameters, the fidelity /„(x), Eq. 10), 
will be denoted by fri{fl), and similarly, the probability 
Pnix) will be written as Prjix)- Note that the guessed 
parameters f]{x) are based on a particular outcome x- 
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This dependence will be implicitly understood when no 
confusion arises. 

The fidelity can be approximated by the first terms of 
its series expansion: 



1 d^f 



2 drjidfjj 



im-Vi)ifij (73) 



where we have used that friirj) = 1 and dfri/df}\fi^ri — 0. 
Averaging over all possible outcomes, we have 



« 1 



itr [H(,,)V(,7)], 



where 



(74) 



(75) 



is the "pointwise" fidelity, H(?7) is the Hessian matrix of 
fn{f]) at fy = ?7, and V(77) is the covariance matrix, with 
elements y^j{■n) = Y.^Pri{x){'ni - Vi){flj -Vj)- 

It is well known that the variance of an unbiased esti- 
mator is bounded by 



(76) 



the so called Cramer- Rao bound 0, , where the 

Fisher information matrix 1(77) is defined as 



drij 



(77) 



The conditional probability (x) regarded as a function 
of rj is called the likelihood function £(77) = Pnix)- It 
is also well known that the bound H76|l is attained by 
the maximum likelihood estimator (MLE) [s^, defined 
as if^^^ = argmax C{r}). Hence this bound is tight. 

A link between the Fisher information and the fidelity 
is obtained by combining (|71|l and (|7S1) , and noticing that 
H(77) is negative definite. We thus have 



1 H(?7) 



(78) 



to leading order and for any unbiased estimation scheme. 

The Fisher information is additive. This means that 

f 2") 

iX') P-n{.x)p'r)ix')^ which happens when we 
perform two measurements [say, {0(x)} ^-nd {O'(x')}] 
on two identical states, the Fisher information of the 
combined measurement is simply I^^^(r7) = 1(77) + I'(77). 
In particular, for N identical measurements, we have 
iW(ry) =iV I(?7). 

Finally, since the OG is a better estimator, and it is 
asymptotically unbiased, we must have 



FOG(7y)c.F---(7y) = l + — tr 



pMLE 



H(77) 



2N Urf) 



(79) 



to leading order, where the fidelities refer to an estima- 
tion scheme consisting of N identical measurements. Be- 
low we use Eq. (I79II to compute the asymptotic limits of 
the fixed measurement schemes discussed in Sec. IIII Al 

We would like to remark that the Cramer-Rao bound 
assumes some regularity conditions on the figure of merit 
and the estimators. These conditions are satisfied for the 
problem considered here, but the bound may not hold in 
more general situations such as the estimation of a mixed 
qubit state (see US])- 



A. 2D states 

The 2D case is rather simple because the states have 
just one parameter and the Fisher information is a sin- 
gle number. Moreover, any von Neumann measurement 
whose vector lies on the equator of the Bloch sphere per- 
formed on a 2D system has I = 1, as can be checked by 
plugging 



1 ± cosf 



.) 



(80) 



into l|77|l . where 9m is the polar angle of to, the direc- 
tion along which the von Neumann measurement is per- 
formed. Therefore, in 2D the Fisher information for a set 
of N measurements (identical or not) is I^^'' = N . 
Since the Hessian is 



H 



1 

2' 



(81) 



any sensible local measurement scheme on 2D states will 
yield 



m 



1 



AN 



(82) 



Note that this fidelity is independent of 6, so it coincides 
with the average fidehty F = / deF{e) /{2tt) = 1 - A/N. 

We recall that this fidelity is attained by the MLE, and 
hence also by the OG. We note that it coincides with the 
collective bound, Eq. 1)2 7|l . This implies that the collec- 
tive bound is attained by tomography, without classical 
communication. It is quite surprising that the most basic 
scheme, with measurements along two fixed orthogonal 
directions, saturates already the collective bound asymp- 
totically. 



B. 3D states 

The 3D case is more involved. The results shown in 
Fig. 13 hint that tomography does not saturate the col- 
lective bound. There are, however, other measurement 
schemes that do saturate this bound and still do not re- 
quire classical communication. We prove these two state- 
ments below. 
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1. Tomography 



2. Random scheme 



Tomography, as explained in Sec. IIII Al consists in 
measuring along three orthogonal directions on the Bloch 
sphere. We will concentrate on the OG estimator, or 
more precisely on the MLE, which is asymptotically 
equivalent. We do not discuss the CLG, Eq. (|^ . as 
it does not attain the collective bound even for 2D states 
(see Fig. 13 . 

We first compute the Fisher information. Consider a 
scheme that consists in repeating 'N times the following: 
take 3 copies of the state and perform a measurement 
along Cx on the first copy, along Cy on the second copy, 
and along on the third copy (recall that N = 33\f). 
These three von Neumann measurements can be regarded 
as a single measurement with 2^ possible outcomes la- 
beled by X = (xi,X2,X3), where = ±1- The probabil- 
ity of obtaining an outcome x is 



Pvix)= n ^(l + Xj"-ej)- 



(83) 



The Fisher information matrix 1(0, 0) of such elemen- 
tary measurement is obtained by substituting Eq. H83(l 
in Eq. (|77|l . Note that the Fisher information of this 
scheme is l'^^He,<j)) = 7^1(61,0). 
The Hessian of the fidelity is 



We now consider the so called random scheme i.e, a 
scheme in which measurements are performed along ran- 
dom directions chosen from an isotropic distribution. In 
contrast to tomography, which only takes into account 
three fixed directions, this scheme explores the Bloch 
sphere isotropically if a large number of copies is avail- 
able. Therefore one can expect that it will perform much 
better. 

This approach is equivalent to performing a covari- 
ant (continuous) POVM on each one of the copies sep- 
arately. Here, we instead regard it as von Neumann 
measurements and a classical ancilla, e.g., a "roulette", 
that tells us along which direction we measure. From 
this point of view the outcome parameters are given by 
X = (C, (u = cost?), tp), where and (p are the azimuthal 
and polar angles of the direction m(u, ip) of the measure- 
ment, and ^ = ±1 is the corresponding outcome. 

Let us compute the Fisher information for a specific 
state T] = {{v = cos 9), 4>). Since this strategy is isotropic, 
the pointwise fidelity F(r]) is independent of rj, and we 
conveniently choose ?] = ((« = 0),0) = 0. By the same 
argument, no average over rj will be needed: F — F{ri). 
The probability is given by 



Priix) = 



1 + ^ fi ■ Tn{u, (p) 





1 - cos 26 



(84) 



which turns out to be independent of 4>. With this, we 
obtain 



1 H 

T^tr 



1(6*, 0) 16M 
35 + 28 cos 20 + cos 46* - 8 cos 40 sin" , 
9 7 cos 26* - 2 cos 40 sin^ 9 



(85) 



Integrating over the isotropic prior probability dn, 
Eq. iPU, we obtain 



1 fdntr'^--^ 



Recalling ^ and N ^ S'N we finally get 

13 



12N 



(86) 



(87) 



As mentioned above, the collective bound F = 1 — 1/A^ 
is not attained by tomography. At this point, the ques- 
tion arises whether classical communication is necessary 
to attain this bound. We next show that this is not the 



and the Fisher information reads 

dudip^ ^dlnprjix) dlnprjix) 



C Li J 



drji drjj 



(89) 



The diagonal elements read 



v?du dip 



cos Lp 



2' 



(90) 



T ^ [ {l-u^)siv?ipdudip 1 

OTT^^^J 1 + ^Vl - M C0S(y9 2 

As for the off-diagonal elements, a straightforward calcu- 
lation gives 



W(O) = l0„(O) = O, 



(92) 



as one could expect, since gaining information on v does 
not provide information on and viceversa. The Fisher 
information matrix thus reads 



The Hessian of the fidelity is 
5V 



H.,(0) 



fl=0 



2 ■ 



(93) 



(94) 
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Finally, using (|79|l we obtain 



We conclude that asymptotically classical communica- 
tion is not required to saturate the collective bound: a 
measurement scheme based on a set of random directions 
does the job. 



V. CONCLUSIONS 

We have presented a selfcontained and detailed study 
of several estimation schemes when a number N of iden- 
tical copies of a qubit state is available. We have used 
the fidelity as a figure of merit and presented a general 
framework which enables us to treat collective as well as 
local measurements on the same footing. 

We have considered two interesting situations: that of 
a completely unknown qubit state (3D case) , and that of 
a qubit laying on the equator of the Bloch sphere (2D 
case). We have obtained the optimal measurements and 
maximum fidelities for the most general collective strate- 
gies. These results, although well known, were scattered 
in the literature and rederived several times. Here we 
have obtained them within a direct and unified frame- 
work. The solution in the 2D case is strikingly simple, 
and can be extended to the case of optimal covariant 
phase estimation. 

These collective schemes yield the ultimate fidelity 
bounds that can be achieved by any scheme, thus set- 
ting a natural scale of what is a good or a bad estima- 
tion. However, they require a joint measurement on the 
N copies, which is usually very difficult, if not impossi- 
ble, to be implemented in practice. The main part of this 
paper has therefore focused on measurements that can be 
implemented in a laboratory with nowadays technology: 
local von Neumann measurements. 

In the 2D case we have shown that, quite surprisingly, 
the most basic tomographic scheme, i.e, measurements 
along two fixed orthogonal directions with the adequate 
data processing (the OG), gives already a fidelity that 
is asymptotically equal to the collective bound. We have 
obtained this limit using the Cramer-Rao bound, which is 
particularly simple to compute in this situation. This re- 
sult is in agreement with the direct (and lengthier) com- 
putation presented in [2^ . 

For the 3D states, tomography, i.e. measurements 
along three fixed orthogonal directions, fails to give the 
asymptotic collective bound, even with the best data pro- 
cessing. The main reason of this failure is that the Bloch 
sphere is not explored thoroughly. We have considered 
an extension that is asymptotically isotropic: a series von 
Neumann measurements along random directions. We 
have proved that this scheme, which does not make use 
of classical communication, does saturates the collective 
bound. Hence, we conclude that in the large N limit, an 



estimation procedure based on local measurements with- 
out classical communication does perform as well as the 
most efficient and sophisticated collective schemes. 

We have also discussed local schemes with classical 
communication, i.e, schemes in which the measurements 
are devised in such a way that they take into account pre- 
vious outcomes. We have studied in detail the one step 
adaptive scheme of Gill and Massar which has very 
interesting features: only two measurement orientations 
are required, adaptivity is only used once, and the esti- 
mation is made by a CLG, which can be read off from 
the frequency of outcomes. The economy of resources 
in this scheme may raise doubts about its efficiency. In 
Appendix ^ we give a simple proof that for large N it 
indeed attains the collective bounds. 

We have also studied strategies that make a more in- 
tensive use of classical communication. In the greedy 
scheme optimization is performed at each measurement 
step [23, 13 • This scheme is the best approach one can 
take if the actual number of available copies is not known. 
We have given a geometrical condition for sequentially 
finding the optimal measurements and have proved that 
they have to be of von Neumann type (see App. IB|| . 
i.e. no general local POVM's will perform better in 
this context. We have illustrated the performance of the 
method with numerical simulations and have shown the 
behaviour of the optimal collective scheme is reached for 
very low values of N. This occurs for N as low as iV = 20 
in 2D and slightly above, N = 45, in 3D. 

In the most general scheme we see that up to TV = 3 
{N = 2 in 2D) there is no need for classical communi- 
cation: the optimal measurements correspond to a set of 
mutually unbiased observables. For larger TV, the knowl- 
edge of the actual value of N provides an extra informa- 
tion that translates into an increase of the fidelity. From 
the practical point of view, however, this difference is 
negligible already at the level of a few copies (A^ > 6). 

Our approach may be extended to other situations. For 
instance, the problem of estimating (qubit) mixed states, 
which is much more involved, can be tackled along the 
lines described here ||2J|. It would also be interesting to 
consider qudits and check whether a set of mutually un- 
biased observables provides the optimal local estimation 
scheme when the number of copies coincides with the 
number of independent variables that parametrize the 
qudit state. 
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APPENDIX A 

The simplest LOCC approach is exemplified by the 
"one step adaptive" scheme 0|. Measurements are per- 
formed along just two different directions, and CLG is 
used 41]. The scheme has two stages and classical com- 
munication is used only once, in going from the first 
stage to the second. This is therefore, a very economical 
scheme from the practical and theoretical point of view. 
We here review the method and give a straightforward 
and comprehensive proof that it saturates the collective 
bound for large N. Given the economy of the scheme, 
this is not an obvious result at all. We focus only on the 
3D case, as the simpler 2D case can be worked out along 
the same lines. 

First stage: One performs A^o = N"" (0 < a < 1) 
measurements with a sensible estimator, in the sense of 
Sec. II VI and obtains an estimation Mq with a fidelity Fq: 

-Fo = 2^ / dn PniXo), (Al) 

xo 

where xo stands for the list of outcomes obtained in this 
first stage. 

Second Stage: At this point we use the CLG on the re- 
maining 2!N = N — N — Nq copies by measuring along 
two perpendiculars directions, u and t/, on the plane or- 
thogonal to Mo • In this basis the final guess can be writ- 
ten as 

M — Mo{xo) cosw -I- (ucosr -I- vsmr) sinuj. (A2) 

This parametrization ensures that M is unitary. The 
angles uj and t depend on the outcomes of this second 
stage, which are the frequencies a^J^, (1 — Q!i)3Nf, (i = 
u,v). The probabilities are given by p„(a) in H42|) . with 
n,a = n ■ u and Uy = ft ■ v. As argued above, we measure 
on the plane orthogonal to Mq because the two outcomes 
of each measurement have roughly the same probability, 
ai ~ 1/2, and they are most informative. It is convenient 
to define the two dimensional vector r, with components: 

Vi = 2ai — 1, i — u,v, (A3) 

which, on average, is close to 0. This vector gives an 
estimation of the projection of the signal Bloch vector 
n on the measurement plane {uv plane). Hence, co is 
expected to be small (Mq w M) and we make the ansatz 



^ = Wrl + rl, tanr=— , (A4) 

where the positive parameter A will be determined later. 

The final fidelity for a signal state n and outcomes 
{Xo,r) is 

r f l + n-M(xo,r) 
/n(xo, r) = 2 ' (^5) 



and the average fidelity F reads 

F^^/dn ^ + "-f^°'-'^ p„(Xo)p„(r1xo). (A6) 

Xo-X 

Notice that the probability of obtaining the outcome r, 
namely, Pn{r\xo) [= Pn{oi) in H42|l with i = u, v], is condi- 
tioned on Xo through the dependence of the second stage 
measurements on Mo(xo)- 

Since we will compute different averages over XOj ^ = 
(tutTv) and n, it is convenient to introduce the following 
notation: 

(/)o = E Mxo,r^Pnixo), (A7) 

Xo 

{f)r = E /"(X0,OPnMX0), (A8) 

f 

{f)n = I dnMxo,r^, (A9) 

and similarly for averages of other functions of xoi 
and n. We will denote composite averaging by simply 
combining subscripts (i.e. {{F)^)^ = {F)^^^). Therefore, 
we write 

F = {f)r,0,n = if). (AlO) 

Since F = {1 + A)/2, we have 

A = (n-M). (All) 

In the expansions that we perform below, we keep only 
the terms that contribute to the the fidelity up to order 
1/iV. Recalling that oj is expected to be small, it follows 
that 

cosu - l- — [rl + rl], (A12) 

sin cij COST = Ar„, (A13) 
sin w sin r = Ar„, (A14) 

to leading order. Therefore, the expectation value in 
(|Alip can be written as 

A = (^[l-^{rl + rl)}jn-M,'j^^ (A15) 

+ M{r u)r n.a + {ry)r Uy) . (A16) 

Since r„, ry (or equivalently a„, ay) are binomially dis- 
tributed, one readily sees that 

{n)r = rii, (A17) 

{r!)r = + 1^. (A18) 

We further recall that n is unitary and that {Mq, u, v} 
is an orthonormal basis, hence nf^ + = 1 — {ft ■ Mq)^, 
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and ljA16ll can be cast as 
A = A 



ft ■ Mo 



-A((n.M„f) +^(l--l^(n.A./on ^(A19) 



To compute the moments ((n • Mo)')o,n, we consider the 
angle d between n and Mg, which is also expected to be 
small. We have 



2Fo-l = (n- Afo) =(cos(5)n 

\ / 0,ri 

where we have used (jAl|l . Therefore 



(A20) 



= l-2q(l-^o). (A21) 
Now we plug this result back into IjAlQp to obtain 



F=(/) = l-(l-A)^(l-Fo)- 



1 - 4(1 - Fq) 
2H 



A^. (A22) 



Since the term (1 — A)^(l — Fq) is always positive, the 
maximum fidelity is obtained with the choice A = 1, and 
we are left with 



F = 1 - 



l-4(l-fo) 
2K 



(A23) 



Since the first estimation is asymptotically unbiased, 
1 — -fo vanishes for large Nq (i.e., for large N) and 

F ~ 1 - — . (A24) 
2K ^ ' 

Recalling that ^ {N - N°-)/2, we finally have 



F - 1 -- + ... . (A25) 
This concludes the proof. 



APPENDIX B 

In this appendix we prove that in the greedy scheme 
the optimal individual measurements on each copy are of 
von Neumann type. We sketch the proof for 2D states. 
The 3D case can be worked out along the same lines. 

The history of outcomes will be denoted, as usual, 
by X- Notice that here we consider general local mea- 
surements (local POVMs) with R outcomes, where R 
is possibly larger than two. Therefore x is a A^-digit 
integer number in base R: x = iN^N-i ■ ■ ■ H (ifc = 
0, 1, . . . , i? — 1). As in Sec. IIII Bl we use the notation 
Xk = ife*fc-i'''ii- A measurement on the fc-th copy 



is defined by a set of non-negative rank-one operators 
{0{Xk)}f-Jo = {0{ikXk-i) Ufc = 0, 1, . . . , i? - 1}, where 



0{xk) = c(xfc)[l + m(xfc) • a] 



(Bl) 



The non-negative constants c(xfc) and the vectors jn(xk) 
are subject to the constraints 



R-l 



E <xk) = 1, 



ik=0 

\m{xk)\ = 1, 



(B2) 

(B3) 
(B4) 



which ensures that 0{xk) > and 0{xk) — 1- Note 
that we allow c{xk) to be zero, thus taking into account 
the possibility that each local POVM may have a differ- 
ent number of outcomes without letting R depend on k. 

Assume we have measured all but the last copy and 
we wish to optimise the last measurement. Recall from 
Sec. in that the fidelity can be written as F = (1 -I- A)/2, 
where 



dn n Pnix)- 



(B5) 



To simplify the notation, let us define r = i^, rhr 
■m{rXN-i), and = c{rxN-i)- Then, 



Pnix) = PniXN-l) [Cr(l + ft ■ TO^)] 



and 



^ = E Ei^(^^^^-i)i= E ^(xiv-i)' 

Xn-1 r Xn-1 

where we have defined d{xN-i) as 



(B6) 
(B7) 



dn ft pn{xN~i) (l + ft- rhr) 



(B8) 

We further write V = V^(xa'-i) and define the symmetric 
positive matrix 



^ijiXN-i) = dn n^Uj p„(xJV-i)- 



Eq. IjBSp becomes 



d = Cr I -l- Arfir 



(B9) 



(BIO) 



(Hereafter the dependency on xn-i will be implicitly 
understood to simplify the notation). 

Our task is to maximize (jBlOjl . Introducing the La- 
grange multipliers A, 7, and cOr, the function we need to 
maximize is actually 



L = (i— AA — 7-f — UJr^r, 



(Bll) 
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where the constraints 

A = - 1, (B12) 

r 

r = ^^c^m^, (B13) 

r 

rn^ — 1 

nr = (B14) 

can be read off from HB2|) . IjBSp and (|B4|I . The factor two 
in the last expression is introduced for later convenience. 
Variations with respect to yield 

— = |y + ATO^|-7-m^-A = 0. (B15) 

OCr 

Notice that the points irir that satisfy this equation define 
an ellipse, £, with focus at —A~^V. Notice also that the 
parameter A at the maximum is the value of d in (|B10|) 
(just multiply (|B15I) by Cr and sum over r taking into 
account the constraints A = and F = 0). 

Finally, consider the variations of rfir in fRil^ . We 



obtain 

(Y -\- Am \ 
A^; 7 = LUrlTlr, (B16) 
\V + Arhr\ J 

which means that the vector inside the brackets is pro- 
portional to rhr- Note that condition fi^ = defines a 
unit circle, and the orthogonal vector to this curve is m^- 
So we only need to prove that the orthogonal vector at 
point rfir of the ellipse £ defined in IjBlSp has precisely 
this direction — this follows straightforwardly taking vari- 
ations with respect to rfir in ljB15|l . Therefore, the so- 
lution is given by the tangency points of ellipse £ and 
circle fir — 0. There are only two such points, they are 
in opposite directions, and all constraints and maximiza- 
tion equations are satisfied with ci_2 = 1/2. This proves 
that the optimal measurements in the greedy scheme are 
indeed von Neumann's • Notice that this is a stronger 
statement than it looks: local measurements with a larger 
number of outcomes will perform worse. 
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